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Abstract. The selection of random subspaces plays a role in quantum information theory analogous to the role of random 
strings in classical information theory. Recent applications have included protocols achieving the quantum channel capacity 
and methods for extending superdense coding from bits to qubits. In addition, random subspaces have proved useful for 
studying the structure of bipartite and multipartite entanglement. 



INTRODUCTION 

In quantum information theory, we're fond of saying that Hilbert space is a big place, the implication being that there's 
room for the unexpected to occur. You will receive no counterarguments to that homespun wisdom from me for the 
simple reason that, as far as I can tell, it's correct. I'm going to present a number of results in quantum information 
theory that stem from the initially counterintuitive geometry of high-dimensional vector spaces, where subspaces with 
highly extremal properties are the norm rather than the exception. Peter Shor has shown, for example, that randomly 
selected subspaces can be used to send quantum information through a noisy quantum channel at the highest possible 
rate, that is, the quantum channel capacity 1 1]. More recently, Debbie Leung, Andreas Winter and I demonstrated that 
a randomly chosen subspace of a bipartite quantum system will likely contain nothing but nearly maximally entangled 
states, even if the subspace is nearly as large as the original system in qubit terms 1 2]. This observation has implications 
for communication, especially superdense coding, as described in 1 3]. Those results and the intuition behind them will 
be my focus here. 



SURPRISES IN HIGH DIMENSION 

Suppose, for the moment, that you are an astronaut orbiting Earth in a space shuttle. Imagine, slightly less plausibly, 
that you are a also mathonaut, meaning that you observe not our Earth but instead a highly idealized version of it 
in which the population is evenly distributed over the whole surface of the planet. Bored with the daily routine of 
gyroscope failures and rebreather malfunctions, you decide to look out the window and count the number of people 
living within a ten kilometer band of the equator. (You have both a good telescope and lots of time on your hands.) Give 
or take a few, you find five million people, with the rest of the population of six billion living elsewhere. Now, bold 
mathonaut that you are, you repeat your observations in higher and higher dimensions, first counting the inhabitants of 
a ten kilometer thickening about the equator of a 3-sphere version of the Earth (in four dimensions), then of a 4-sphere 
and so on up. Long before you reach the 50-sphere, however, you discover a great time saver: count the people outside 
of the band. There aren't any. Perplexed, you decide to check if your luck was bad by selecting other equators for the 
50-sphere, but each time you find that every single inhabitant of the planet lives within ten kilometers of the equator. 

What's going on? Nothing too sophisticated, it turns out. The calculation itself is completely elementary, an exercise 
in spherical coordinates, but the effect is an example of the broader "concentration of measure" phenomenon: naturally 
defined random variables on high-dimensional spaces tend to concentrate strongly around their average values |4]. The 
most familiar example of this is probably the case of the sum of n independent, bounded random variables. According 
to Chernoff's bound, the probability that the sum deviates more than e from its mean value is less than exp(— Cne 2 ) 



for some positive constant C. The analogous statement for functions on the fc-sphere is known as Levy's Lemma: 

Levy's Lemma (See H/, Appendix IV, and [4]) Let /:§*—> R be a function with Lipschitz constant r\ (with respect 
to the Euclidean norm) and a point X 6 be chosen uniformly at random. Then 

Pr{/(X)-/^ ±a) <exp(-C(£-l)a 2 /r] 2 ) (1) 

for some constant C > 0. 

Here / is used to denote either the mean value or a median for /; the median is actually a more natural quantity in the 
theory of concentration of measure. The function relevant to our mathonaut investigations is simply f(x\ , . . . ,x n ) =xu 
which obviously has Lipschitz constant one and both mean and median of zero. 



RANDOM STATES AND RANDOM SUBSPACES 

Quantum states, of course, are represented as unit vectors, so Levy's Lemma provides a ready-made tool for exploring 
the properties of random quantum states in high-dimensional systems. We need only choose the function / and plug 
in its mean value. 

The example that will occupy us is the entanglement of a bipartite system. Let |0) be a random pure state in 
C dA ®C dB , chosen according to the unitarily invariant measure, which in turn corresponds to the uniform measure 
on the (2d A ds — l)-sphere. Assuming without loss of generality that d A < dg, the expected value of the entropy of 
entanglement is known I6t I7ll8ll9l. il Oil and satisfies 

EE((j>)=ES((l> A )>log 2 d A -^^-, (2) 

where S is the von Neumann entropy. Since any state of this bipartite system can have no more than log 2 d A ebits of 
entanglement, this tells us that on average the entanglement is within one ebit of being maximal. Levy's Lemma allows 
us to quantify how likely it is that the entanglement of a random state will fall significantly below the mean. Define 
P = E2 3b • O nce a ^ tne calculations are done, we get the following bound: 

Pr{5(^ A ) < \og 2 d A a- 13} < exp (- &*L=1^ \ , (3) 

for some C > provided ds> d A > 3. Ignoring the small (loga^) 2 factor in the denominator of the exponent, this is 
the same type of exponential convergence to the mean that occurs for population evenly distributed on the fc-sphere. 

The convergence is so rapid, in fact, that it is possible to strengthen these results about random states into statements 
about random subspaces. The idea is to fix a subspace Sq of dimension s and choose a very fine net of states ,Aq C So, 
so fine that given any state \ <j>) €Sq, there is an approximating \ <j>) £ jVq such that ||0 — 0|| i < £. If we choose a random 
unitary U according to the Haar measure, it takes Sq to a random subspace USq and it takes the net JVq to a net U,Aq 
for the new subspace. The probability that a given state in U jY§ has entanglement less than \og 2 d A — a — f5 is given 
by Equation while the probability that any one of them has entanglement less than log 2 d A — a — j3 is then bounded 
above by 

(d A ds - l)Ca 2 
(log«U) 2 

As a net on the unit ball of a subspace of real dimension 2s, the size of j¥q will scale as (C/e) 2s for some constant 
C > 0. Proving the existence of a subspace in which all states are highly entangled then becomes a matter of tuning 
the resolution of the net Jf§ and the value of a. We find that when dg>d A >3 and < a < 1, there exists a subspace 
of C dA O C dB of dimension 

Ca 25 

d A d Bj . — yrr > (5) 
(logaU) j J 

where C > is, as always, a constant. From now on, I'll refer to a subspace having this property (for fixed a) as a 
maximally entangled subspace. In qubit terms, in a bipartite system of n by n + o(n) qubits, this is a subspace of size 
2n — o(n) qubits in which all of the states of entanglement at least n — o(l) ebits. The maximally entangled subspace 
is nearly as large as the whole system. 



\^M~ yA z::z_ )■ (4) 



For the sake of unfair comparison, we could consider the subspace spanned by any two Bell states of a pair of qubits. 
Any such subspace will not only fail to contain only nearly maximally entangled states, it will always contain some 
product states! 

CONSEQUENCES FOR ENTANGLEMENT MEASURES 

Should we be alarmed? Unconcerned? Unconvinced? One way to place the result in context is to reinterpret it in 
the language of mixed state entanglement measures. Consider the maximally mixed state p on one of the maximally 
entangled subspaces. Because the range of p consists only of these states, any convex decomposition of p into pure 
states will again be into these nearly maximally entangled states. Continuing to use the language of qubits, in an n by 
n + o(n) qubit system, p will have entanglement of formation 

E f (p)=n-o(l), (6) 

which is nearly maximal. On the other hand, as the maximally mixed state on a subspace of 2n — o{n) qubits, p will 
have entropy at least S(p) = 2n — o(n). In fact, the parameters can be tuned such that the quantum mutual information 
satisfies 

S{Pa)+S{Pb)-S(Pab) = O(logn). (7) 

The distillable entanglement of the p is therefore also (9(log«) frill . This leaves a huge gap between the entanglement 
of formation and the entanglement of distillation, the first being almost as large as it can be with the second 
simultaneously nearly as small as it can be. Ignoring issues of the additivity of the entanglement of formation, the 
state p provides an example of a state that is nearly as hard to make as a maximally entangled state and yet is nearly 
useless as a resource. In other words, this p would be an example of a state exhibiting near-maximal irreversibility. 



SUPERDENSE CODING OF QUANTUM STATES 

Another way to place the existence of these maximally entangled subspaces in context is to study their applications to 
communication. Suppose that Bob has in mind a specific state \(j>) on a quantum system S that he would like to send 
to Alice. (Their roles are reversed from the usual convention in order to be consistent with the rest of the paper.) If S 
were a bipartite system C dA <g>C dB and |0) was promised to be maximally entangled, then Bob could take advantage 
of superdense coding 1 12]: Alice and Bob would pre-share a fixed maximally entangled state and in order to send 
\<p), Bob would apply a local unitary transformation U$ before sending his half of the system to Alice. That's fine, of 
course, but the promise that |^>) be maximally entangled would seem to make this a very special case, especially since 
Alice ends up with both halves of the bipartite system. Actually, thanks to the existence of a maximally entangled 
subspace, this is essentially the general case. If Alice and Bob pre-share a fixed maximally entangled state and agree 
on an embedding S C C dA <8> C ds of a maximally entangled subspace, then Bob can send to Alice any state | <j>) £ S using 
the simple protocol, up to small errors, since they are all nearly maximally entangled. The qubit accounting then works 
as follows: Bob can send Alice an arbitrary 2n — o(n) qubit state by consuming n ebits of entanglement and sending 
n +o(n) qubits, achieving the two-for-one savings normally associated with sending only classical information. (An 
earlier version of the result achieved the same rates but consumed extra shared random bits lll3ll .') 

The superdense coding idea can be pushed even further, to the case where the state to be prepared in Alice's lab is 
entangled with Bob's system and, therefore, no longer pure. A quick check of the extremal situation suggests that this 
should be easier: if the goal is prepare a fixed maximally entangled state between Alice and Bob's labs, then, provided 
Alice's system is no larger than Bob's, no communication is required at all; Bob need just perform an appropriate 
local unitary on his own system. The interpolation between the two-for-one of pure states and the no communication 
of maximally entangled states is analyzed in 1 3] using techniques similar to those discussed here, with the result that 
the largest Schmidt coefficient A max of all the states to be prepared controls the trade-off. To leading order in the 
asymptotics, j log 2 s + j log 2 A, nax qubits and j log 2 s — 5 log 2 A max ebits are required, s is defined as before to be the 
dimension of the system being prepared on Alice's side alone. 



MULTIPARTITE ENTANGLEMENT 



The results on bipartite entanglement extend easily to the multipartite realm. For convenience, consider a random 
state of n qudits, so that \(j>) G (C d )®" and assume that n is held fixed while d is allowed to increase. The following 
conclusions about random states are essentially corollaries of what we've already seen: 

• The pure state entanglement across every bipartite cut is likely to be near maximal simultaneously. 

• If k > n/2 then the reduced state of any k qudits will likely have near-maximal entanglement of formation. 
Meanwhile, if k < n/2 then it is likely that the entanglement of formation becomes less than any positive constant. 

• With the participation of the remaining n — 2 parties, any pair of parties can distill a nearly maximally entangled 
pure state. 

The last item is at first glance probably the most surprising but no harder to prove than the others. The distillation 
protocol consists of the remaining n — 2 parties each measuring in a random local basis. The state shared by the other 
two conditioned on the outcome of this measurement is essentially random and, therefore, nearly maximally entangled. 



CONCLUSION 

While it is in retrospect no surprise that techniques for dealing with random subspaces should prove useful in quantum 
information theory, the ease with which they can be analyzed was certainly a surprise to me. Random subspace 
techniques have been a mainstay of the "local theory of Banach spaces" ever since Milman |Q gave a proof of 
Dvoretsky's Theorem fl5ll using concentration of measure ideas. It is amusing and perhaps instructive to note that the 
title of a classic book by Milman and Schechtman on the subject, "Asymptotic theory of finite dimensional normed 
spaces," concisely sums up in mathematical terms one of the main goals of quantum information theory. 
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